AWS Pyspark MDM Developer

Delivery South San Francisco, California


Description

  • Does solving complex business problems and real world challenges interest you? Do you enjoy seeing the impact your contributions make on a daily basis? Are you passionate about using data analytics to provide game changing solutions to the Global 2000 clients? Do you thrive in a dynamic work environment that constantly pushes you to be the best you can be and more? Are you ready to work with smart colleagues who drive for excellence in everything they do? If you possess a solutions mindset, strong architectingskills, and commitment to be part of a tremendous journey, come join our growing, global team. See what Saama can do for your career and for your journey.

     
    Saama Analytics has been on the forefront of data innovation for the last two decades and continues to offer cutting-edge data analytics solutions powered with big data, cloud, and AI/ML aptitudes for its customers in Life Sciences, Insurance, CPG, and other industries. Saama is committed to finding the best people because the innovations and discoveries that enabled together leads to better technologies, better treatments, and a better future.
     
    Responsibilities:
    • Lead the design, implementation, and optimization of scalable data pipelines and architectures utilizing AWS Glue, Elastic MapReduce (EMR), Lambda, Redshift, Athena, DynamoDB, OpenSearch, and S3.
    • Use Spark on AWS for data transformation and processing across large datasets.
    • Develop and maintain efficient data workflows with SQS for task queueing and orchestration.
    • Integrate, transform, and manage data using Mulesoft for seamless data integration.
    • Ensure high-performance data storage, retrieval, and analytics across Redshift, DynamoDB, and Athena.
    • Oversee data consistency, integrity, and compliance through IQVIA MDM solutions.
    • Apply best practices in data governance, security, and scalability within a collaborative and cross-functional team environment.

    Qualifications:
    • Proven expertise in AWS data engineering, specifically with Glue, EMR, Lambda, Redshift, Athena, DynamoDB, OpenSearch, and S3.
    • Some experience with data integration (Mulesoft, Talend)
    • Working knowledge of master data management.
    • Demonstrated ability to lead technical projects and mentor data engineering teams.
    • Exceptional analytical and communication skills.
     
    Work Environment
    This job operates in a professional remote office environment. This role routinely uses standard office equipment, including but not limited to, computers, phones, and photocopiers.
    Physical Demands
    This position requires the frequent and repetitive use of a computer, keyboard, and mouse. Hand and finger dexterity is required.
    Other Duties
    Please note that this job description is not designed to cover or contain a comprehensive listing of activities, duties, or responsibilities required of the employee for this job. Duties, responsibilities, and activities may change at any time, with or without notice.
    EEO 
    Saama Technologies, Inc. provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws.
    This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.